AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
RL training model

# RL training model

Acereason Nemotron 14B GGUF
AceReason-Nemotron-14B is a mathematical and code reasoning model trained through reinforcement learning, which performs excellently in multiple mathematical and code reasoning benchmark tests.
Large Language Model Transformers
A
QuantFactory
326
2
Acereason Nemotron 7B GGUF
AceReason-Nemotron-7B is a mathematical and code reasoning model trained based on reinforcement learning. It starts training from DeepSeek-R1-Distilled-Qwen-7B and performs excellently in multiple benchmark tests.
Large Language Model Transformers
A
QuantFactory
326
2
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase